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Abstract 

This decade will see the first direct detections of gravitational waves 
by observatories such as Advanced LIGO and Virgo. Among the prime 
^ sources are coalescences of binary neutron stars and black holes, which 

are ideal probes of dynamical spacetime. This will herald a new era in the 
^ empirical study of gravitation. For the first time, we will have access to the 

{^JJ) genuinely strong-field dynamics, where low-energy imprints of quantum 

' ' gravity may well show up. In addition, we will be able to search for effects 

^ which might only make their presence known at large distance scales, such 

^ as the ones that gravitational waves must traverse in going from source 

I to observer. Finally, coalescing binaries can be used as cosmic distance 

markers, to study the large-scale structure and evolution of the Universe. 

With the advanced detector era fast approaching, concrete data anal- 
ysis algorithms are being developed to look for deviations from general 
' relativity in signals from coalescing binaries, taking into account the noisy 

detector output as well as the expectation that most sources will be near 
the threshold of detectability. Similarly, several practical methods have 
been proposed to use them for cosmology. We explain the state of the art, 
• • including the obstacles that still need to be overcome in order to make 

optimal use of the signals that will be detected. Although the emphasis 
will be on second-generation observatories, we will also discuss some of 
the science that could be done with future third-generation ground-based 
facilities such as Einstein Telescope, as well as with space-based detectors. 
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1 Introduction 

General relativity (GR) is a highly non-linear, dynamical theory of gravita- 
tion. Yet, until the 1970s, almost all tests of GR were theoretically based on 
the behavior of test particles in a static gravitational field T. These include 
the perihelium precession of Mercury, deflection of star light by the Sun, and 

*Chapter to appear in the Springer Handbook of Spacetime, Springer Verlag, 2013. 
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Shapiro time delay. The parameterized post-Newtonian (PPN) formahsm (for 
an overview, see [2]) was developed to provide a systematic framework for these 
and other checks, by appropriately parameterizing various aspects of spacetime 
geometry viewed as a non- linear superposition of contributions from e.g. the 
Sun and the planets. Even so, the most important early experiments did not 
require much more than an expansion of the Schwarzschild metric in GM / [c^r) 
up to the first few sub-leading terms, with M the mass and r the distance. 
Although excellent agreement with theory was obtained, the aspects of GR that 
were actually tested were somewhat limited, mostly amounting to the influence 
on the motion of test particles of low-order general-relativistic corrections to the 
Newtonian gravitational field. 

The situation improved with the discovery of the Hulse- Taylor binary neu- 
tron star in 1974 |3_, . One of the two stars can be observed electromagnetically 
as a pulsar, and from this signal it was inferred that the orbital motion of 
the binary changes as predicted by GR, assuming that orbital energy and an- 
gular momentum are being carried away by gravitational waves (GW). This 
was an event of historic significance, as it provided incontrovertible evidence of 
the dynamical nature of the gravitational field. Subsequently, similar and even 
more relativistic binary neutron stars were discovered, allowing for new tests 
of GR in a "post-Keplerian" framework [4]. Nevertheless, explaining the ob- 
served dissipative dynamics related to gravitational wave emission only requires 
a lowest-order approximation to GR in powers of w/c, with v a characteristic 
velocity. 

Some of the most exciting aspects of general relativity still remain out of our 
empirical reach. What we would like to explore is the full non-linear dynamics of 
the gravitational field itself, including its self-interaction. From this perspective, 
even the most relativistic binary neutron star system that is currently known, 
PSR J0737-3039 4 , is still in the relatively slowly varying, weak-field regime, 
with an orbital compactness of GM/{c^R) ~ 4.4 x 10~^ (where M is the total 
mass and R the orbital separation), and a typical orbital speed u/c ~ 2 x 10~^{^ 
By contrast, for a compact binary just prior to the final plunge and merger one 
has GM/{c^K) ~ 0.2 and v/c ~ 0.4. This will bring us to the genuinely strong- 
field, dynamical regime of general relativity, which in the foreseeable future will 
only be accessible by means of gravitational wave detectors. 

The first detection of gravitational waves by the Advanced LIGO and Virgo 
detectors might happen as early as 2015, and certainly before the end of the 
decade [S]. Around 2020, a network of five large interferometric GW detectors 
will be in place: in addition to the two Advanced LIGO interferometers [5] 
and Advanced Virgo [7], there will be the Japanese KAGRA [H], and IndlGO 
in India [H]. There is also the smaller GEO-HF in Germany [TU]. These are 
usually referred to as second-generation detectors. A conceptual design study 
for a third-generation observatory called Einstein Telescope (ET) was recently 
concluded [TT], and there are plans for a space-based observatory called LISA 
[T^ . There is a considerable body of literature on the projected capabihties of 
ET and LISA in probing the dynamics of gravity. Although attention will be 
given to these, our main focus in this chapter will be on what can be achieved 
with the upcoming advanced detectors. In particular, in the last few years, 

^For comparison, the surface gravity of the Sun is ~ 2 X 10~®, and the orbital speed of the 
Earth as it moves around the Sun is d/c ~ 10~^. 
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development has started of hands-on data analysis techniques for use on signals 
detected with second-generation observatories, properly taking into account the 
noisy detector output as well as the expectation that most sources will be near 
the threshold of detectability. 

Since the advent of GR, a large number of alternative theories of gravity 
have been proposed; for a partial list, see [T3]. Among these, GR tends to be 
the simplest and the most elegant; moreover, many of the alternatives are al- 
ready strongly constrained by existing observations. Consequently, our primary 
aim will not be to seek confirmation of an alternative theory and measure its 
parameters; rather, we want to develop a test of GR itself. The testing should 
be as generic as we can make it, in the sense that if macroscopic deviations 
from GR exist, we want to find them even if they take a form that is yet to be 
envisaged, rather than looking inside a class of particular, existing alternative 
theories. That said, we will occasionaly mention the predictions of such alter- 
native models to indicate the power of the probe that direct gravitational wave 
detection will provide us with. 

Recently proposed tests of the strong-field dynamics broadly fall into two 
categories. One consists of checking for possible alternative polarization states 
beyond the two polarizations that GR predicts, and which might only make 
their presence known in the case of gravitational waves that were generated 
in the ultra-relativistic regime. The other category focuses on the coalescence 
process of compact binary objects (neutron stars and black holes) [5]. 

Searching for alternative polarizations started in earnest with the detailed 
studies made on the electromagnetically observed binary neutron stars. Here 
we will explain how one would go about looking for their signature in data from 
gravitational wave detectors, in particular using transient signals such as will be 
emitted by supernova explosions or, again, coalescing compact binaries. There 
have also been studies about how to use stochastic gravitational waves for this 
purpose [H] ; these could take the form of a primordial GW background, or they 
could be a "bath" of radiation caused by a large number of unresolvable astro- 
physical sources, such as the combined population of all compact binary coales- 
cence events, or cosmic string cusps. Due to space limitations, here we will limit 
ourselves to resolvable transient sources. Although with a single interferometric 
detector one would not be able to tell the difference between, or even measure, 
additional polarizations, this does become possible with a network of detectors. 
In particular, it is possible to combine the outputs of multiple interferometers to 
construct a so-called null stream, which should be devoid of signals if the only 
polarization states present are the ones predicted by GR. More generally, one 
can have null streams which in addition to the usual tensor polarizations also 
exclude one or more of the alternative ones, allowing one to tell them apart. 
Here we will partially follow the recent discussions by Chatziioannou, Yunes, 
and Cornish [15], and by Hayama and Nishizawa [16]. 

The coalescence of compact binaries consists of three regimes: an adiabatic 
inspiral, the merger leading to the formation of a single black hole (or an ex- 
otic alternative object!), and the ringdown of the resulting object as it evolves 
towards a quiescent state. The inspiral regime is reasonably well understood 
thanks to the so-called post-Newtonian formalism |17j , in which physical quan- 
tities such as energy and flux are expanded in powers of u/c. A test of GR 
could then take the form of identifying directly measurable quantities, such as 
post-Newtonian coefficients in an expansion of the orbital phase, which in GR 
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are inter-related, and checking whether the predicted relationships really hold. 
This would constitute a very generic test of GR, in which no recourse needs to 
be taken to any particular alternative theory of gravity. Such a test was first 
proposed by Arun et al. [18]. Next, Yunes and Pretorius developed the pa- 
rameterized post-Einsteinian (ppE) framework, which considerably generalized 
the family of waveforms used in |18| by allowing for a larger class of parame- 
terized deformations [TSl [12] . The basic idea of Arun et al. was implemented 
in a Bayesian way by Li et al. using waveforms in the ppE family |201 121j . 
The latter approach focuses on hypothesis testing. This has the advantage that 
since for every detected sources the same "yes/no" question is asked, evidence 
for or against GR has a tendency to build up as information from an increasing 
number of detections is included. 

Moving beyond the inspiral regime, the ringdown can be studied in a variety 
of ways. In particular, the no hair theorem can be tested, which says that in GR, 
the spacetime around a quiescent, electrically neutral black hole is determined 
uniquely by its mass and spin j^H 1^3] . The ringdown process can be modeled as 
perturbations on a fixed black hole spacetime, and the Einstein field equations 
impose relationships between the ringdown frequencies and damping times of 
the various modes that can get excited [24]. Verifying these inter-dependences 
amounts to testing the no hair theorem. Moreover, it has been shown that the 
amplitudes of the ringdown modes carry information about the masses and the 
spins of the binary compact object that merged to form a single black hole [23]. 
As shown earlier by Ryan, the no hair theorem can also be tested by monitoring 
the motion of a small object (a neutron star or a stellar mass black hole) around 
a very massive black hole or exotic object, effectively mapping out the latter's 
spacetime geometry [23]. 

As demonstrated by Schutz already in 1986, inspiraling and merging com- 
pact binaries can also be used as cosmic distances markers, or standard sirens, to 
probe the large-scale structure and evolution of the Universe [5S]. A variety of 
techniques have been proposed to exploit this fact using the second-generation 
detectors, ET, and space-based detectors. The second-generation observatories 
will mainly give us information about the Hubble constant Hq; however, they 
will do so in a way that is completely independent of existing measurements, 
and in particular does not require the so-called cosmic distance ladder, with its 
potentially unknown systematic errors at every rung [57| [23 [55] ■ In the case of 
ET and space-based detectors, it is also possible to study the matter content 
of the Universe [HUl [SB [3H [331 [31] • By now we know that the expansion of the 
Universe is accelerating [35) . which can be modeled heuristically by invoking a 
new substance called dark energy. An exciting prospect is probing the equa- 
tion of state of dark energy with gravitational waves, again in a way that is 
independent of conventional observations. 

This chapter is structured as follows. In Sec. [2] we discuss how one might 
look for alternative polarization states in transient GW signals, using a network 
of detectors. Next, we explain how the inspiral of compact binaries can be used 
to arrive at a very generic test of the strong- field dynamics of general relativity, 
including self-interaction (Sec. [s]). The emphasis will be on second-generation 
detectors, where most sources will be near the threshold of detectability. As we 
shall see, in the case of binary neutron stars, appropriate data analysis methods 
are already in place, which can be applied to raw data from the advanced 
detectors as soon as they become available. Binary black holes are dynamically 
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far richer, but they also pose formidable data analysis problems. A discussion of 
merger and ringdown, and tests of the no hair theorem, will naturally bring us 
to third-generation ground-based observatories as well as space-based detectors 
(Sec. El), and we will give an overview of what one might expect from them. 
In Sec. pj we will briefly recall the basics of modern cosmology, and investigate 
what gravitational wave observations might have to say about the evolution of 
the Universe. A summary and conclusions are given in Sec. [6] 

We will denote binary neutron stars by BNS, neutron star-black hole systems 
by NSBH, and binary black holes by BBH. The usual post-Newtonian notation 
will be employed, where "gPN order" with q integer or half-integer refers to 
O [(ti/c)^"^] beyond leading order in expansions in v/c. Unless stated otherwise, 
we use units such that G — c — 1. 



2 Alternative polarization states 

In the so-called transverse-traceless gauge, the metric perturbation only has spa- 
tial components, and for a signal propagating in the z direction in a coordinate 
system associated with unit vectors {ex, Cy, Sz), it can be brought in the form 

m 

hj^ ^h+e++hy,ef^, (1) 

with 

e+ = ex®ex-ey®ey, (2) 
e^^- = ex®ey + ey®ex- (3) 

/i+ and ft-x are the magnitudes of the independent "plus" and "cross" tensor 
polarizations, respectively. The response to a gravitational wave of an L-shaped 
interferometric detector is a linear combination of these: 

h{t)=F+h+[t)+FJi^{t). (4) 

The beam pattern functions F^, Fx depend on the sky position Q = {9,4') of 
the source 

F+ = ^(1 + cos2 6') cos20, (5) 

Fx = — cos 6 sin 2(j). (6) 

In metric theories of gravity, up to 6 degrees of freedom are allowed [2]; 
these are illustrated in Fig.[T] Other than the plus and cross polarizations, they 
include a scalar "breathing mode" ("b"), a scalar longitudinal mode {"£"), and 
vectorial modes ( "vx" , "vy" ) . The full metric perturbation then takes the form 
(see [13] and references therein) 

= h+e+j + /ix ef^. + hy,e^j + hfe\j -f /ivxe™ + h^yeJJ, (7) 

^In addition, there is a rotational degree of freedom tp around the axis defined by A, but 
without loss of generality we can take ■0 = 0. 
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Figure 1: In metric theories of gravity, up to 6 polarization states are allowed. At 
top left and right, we illustrate the transverse "+" and "x" tensor polarizations. 
At middle left, the transverse "breathing" mode is shown, and at middle right 
the longitudinal scalar mode. At bottom left and right, one has the vector 
modes. (Adapted from [2].) 



with 

e'' = e^'Siex + ey'^ey, (8) 

^/2e^(g)e,, (9) 

Ex® + e^® e-x, (10) 

ey®ez + ez®ey. (11) 



e 



The full response of an interferometer to a signal containing all these polarization 
states is 

h = F+h+ +Fy,hy,+ F^K + Fihe + F^^K^ + F^yh^y, (12) 

and 

i^b = -^sin=^6l cos2(^, (13) 

Fi = ^ sin^ 6* cos 20, (14) 
V 2 

Fvx = sin 261 COS 20, (15) 

Fvy = sin 6* sin 20. (16) 

Currently, observational constraints on additional polarization modes are 
limited. From the Hulse- Taylor double neutron star we know that the energy 
loss due to non-tensor emission must be less than 1% [HHS]- However, alter- 
native polarizations might show up in more weak-field regimes and after hav- 
ing propagated over distances much larger than the characteristic scale of the 
Hulse- Taylor binary. Alternatively, they might only appear in situations where 
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the source is far more relativistic, with high characteristic velocities vjc. In core 
collapse supernovae, radial velocities vjc ^ 0.25 are attained |36j . which may 
excite longitudinal modes. In the case of binary inspiral, v /c > 0.4 is reached 
before the final plunge, which (using Kepler's third law) corresponds to a grav- 
itational wave frequency / = c^{v/cY /{nGM), with M the total mass. For 
binary neutron stars with component masses (1.4, 1.4) this approximately 
equals 1600 Hz, which is in the sensitivity band of ground-based detectors. 

There are a number of alternative theories of gravity which predict non- 
standard polarization states. To name but a few: 

• Brans-Dicke theory is a scalar-tensor theory of gravity which has scalar 
modes [57] . 

• Scalar modes also occur in Kaluza-Klein theory, where our 4D world arises 
after compactification of one or more extra spatial dimensions j38j . 

• Certain brane world models, such as the Dvali-Gabadadze-Porrati model 
in the self-accelerating branch, have all six modes above [39] . 

A single interferometric detector would not suffice to disentangle all these 
polarization states. To see this, consider a breathing mode (see Fig. [T]) imping- 
ing upon a detector, coming from a direction that is perpendicular to the plane 
of the interferometer. Then both detector arms would get lengthened and short- 
ened in unison, but what an interferometer senses is the relative difference in arm 
length. Or, consider a breathing mode whose propagation direction corresponds 
to the orientation of one of the arms. Then this arm would be unaffected, while 
the other arm would still periodically get lenghtened and shortened, leading 
to a relative difference in arm lengths, which however would be indistinguish- 
able from the effect of a gravitational wave with "plus" polarization. Hence, a 
network of interferometers is called for. 

Consider D detectors at different positions on the Earth and whose noise 
is uncorrelated. A signal would reach the interferometers at different times. 
However, if one knew the sky position Cl, e.g. because of an electromagnetic 
counterpart to the gravitational wave signal as might be expected from a con- 
veniently oriented BNS or NSBH event [40], then one would know how to time 
shift the outputs of the detectors to analyze the signal at a fixed time, say 
the arrival time at the Earth's center. Since from now on we assume Vl to 
be known, we omit any explicit reference to it in expressions below. In each 
detector A= 1, . . . , Z?, the output will take the form 

d^{f) = h^{f) + n^U). (17) 



where each of the Ka takes the general form ( 12 ), and the ua represent the noise 
in each of the detectors. The overbar indicates that (a) we will be considering the 
Fourier transforms of the relevant quantities, which are functions of frequency 
/ rather than time t, and (b) the data streams have been divided by \/ SAif), 
with SA^f) the noise spectral density (basically the variance of the noise as a 
function of frequency) for detector A. The latter ensures that we will not have to 
worry about differences in design and operation between the various detectors. 



Eq. (17 1 can be expressed in terms of the beam pattern functions: 

d^{f)=F^ha{f)+n^{f), (18) 
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where a — 1, ... ,6 runs over the polarization states "+", "x", "b", "f, "vx", 
and "vy", and summation over repeated indices is assumed. The first term in 
the right hand side expresses the signal as a linear combination of five vectors 
in the D-dimension al sp ace of detector outputs: F+, Fx, Fvx, Fvy, and F^; 
indeed, from Eqns. (13), (14), it is clear that F^ = — -\/2Fb, so that one only 
has one independent vector pertaining to the scalar modes. Also note that the 
remaining vectors can be linearly independent only if D > 5. 

In general relativity, only the tensor modes /i+ and hx are present. Given 
three detectors (e.g. the two Advanced LIGOs and Advanced Virgo, which will 
be the first to take data) , a null stream can be constructed by eliminating these 
modes from the output vector d, following the original idea by Giirsel and Tinto 

m- 



null 



F+ AF> 
IF, AF> 



•d. 



(19) 



where F+ A Fx is the vector whose components in the space of detector outputs 
are 

^ABCpBpC^ (20) 

with ^Yie completely antisymmetric symbol, and here too summation over 

repeated indices is understood. It is not difficult to see that dGR,nuii can only 
contain non-standard polarizations; the tensor modes are projected out. This 
is illustrated in Fig. |2] Hence, if GR is correct, dcR.nuii should not contain a 
signal. If, on the other hand, one or more of the alternative polarizations /ib, 
hi, /ivx, ^vy are present, then they will show up as a statistical excess in the 
null stream. 




Figure 2: An illustration of the construction of the null stream dGR,nuii from a 
3-detector output. The vector of outputs d, and the beam pattern vectors F-|_ 
and Fx , live in a 3-dimensional space. The null stream is obtained by projecting 
d onto the unit normal to the plane determined by F^, Fx. The projection is 
guaranteed not to contain tensorial polarization modes. 



Sometime after 2017, the Japanese KAGRA will become active, and there 
will be four detectors, so that D = A. This will allow for the construction of 
two null streams which in addition to the tensor modes will also be devoid of 



8 



e.g. one of the two vector modes and one of the two scalar modes: 



(4),i _ F+ A Fx A Fvx -■ , , 

- |F+AFxAF. ' 



VX| 



(4)^2 _ F+ A Fx A F^ - 



where the wedge product is defined analogously to Eq. (20), but now using the 
4-dimensional antisymmetric symbol e^-^C"^. Note that for D = A, there can 
not be a third independent null stream which also excludes the tensor modes. 

Finally, around the end of the decade, IndlGO may also be taking data, so 
that D = 5. In that case three null streams can be constructed that exclude the 
tensor modes: 

(5) ,1 _ F+ AFx AFy^AFyy - 

dGR.„un - |F^^F^^F..^F.,| ^' ^^^> 

(5) .2 _ F+AFxAF..AF, 

- |F+AFxAF..AF,| ^' ^'^^ 
(5)^3 _ F+AFxAFyyAF, 

dGR..un - |F+AFxAF.,Af,| 

(26) 

If a theory that has scalar modes happens to be the right one, then there will 
be a signal in '^'rfcRnuii above. If there are vector modes, then they will show 

up in (^)d^R,„un and/or ^^^d^R^nun- 

Hayama and Nishizawa showed how to reconstruct the polarization modes in 
the case where the number of detectors is at least the number of modes, based 
on the null stream idea ^16) . As an illustration, they reconstructed a simulated 
longitudinal mode in Brans-Dicke theory. Such a mode might be emitted by a 
supernova explosion, in which radial velocities w/c ~ 0.25 are reached [36]. 

If a statistical excess is seen in one or more null streams, then one would 
like to match-filter them with template waveforms that allow for one or more 
alternative polarization states to obtain information about their physical con- 
tent. Such waveform models were developed in the context of the (extended) 
parameterized post-Einsteinian (ppE) framework by Chatziioannou et al. [15j . 
and we refer the reader to that paper for details. The original ppE framework 
will be discussed below. 



3 Probing gravitational self-interaction 

3.1 The regime of late inspiral 

Within GR, especially the inspiral part of the coalescence process has been 
modeled in great detail using the post-Newtonian (PN) formalism (see [T7] and 
references therein) , in which quantities such as the conserved energy and flux are 
found as expansions in v /c, where v{t) is a characteristic speed. During inspiral, 
the GW signals will carry a detailed imprint of the orbital motion. Indeed, the 
main contribution has a phase that is simply 2$(i), with <I>(i) the orbital phase. 
Thus, the angular motion of the binary is directly encoded in the waveform's 
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phase, and assuming quasi-circular inspiral, the radial motion follows from the 
instantaneous angular frequency ui(t) = ^(t) through the relativistic version 
of Kepler's Third Law. If there are deviations from GR, the different emission 
mechanism and/or differences in the orbital motion will be encoded in the phase 
of the signal waveform, allowing us to probe the strong-field dynamics of gravity. 

In this section, we shall employ the usual post-Newtonian notation, in which 
"qPN order", with q an integer or half-integer, refers to contributions at (v/c)'^'^ 
beyond leading order. 

Up to a reference phase, the orbital phase takes the form [42] 

In general relativity, the coefficients ipn and Lpn^ depend on the component 
masses mi, m2 and spins Si, S2 in a very specific way; these dependences are 
currently known up to n = 7. The different PN terms in the phasing formula 
arise from non-linear multipole interactions as the wave propagates from the 
source's "near zone", where gravitational fields are strong, to the "far zone", 
where detection takes place. Specifically, the physical content of some of the 
contributions is as follows: 

• ip3 and (^5 encode the interaction of the total (Arnowitt-Deser-Misner, or 
ADM IJ) mass-energy of the source with the quadrupole moment. The 
physical picture is that the quadrupolar waves scatter off the Schwarzschild 
curvature generated by the source. These contributions are referred to as 
gravitational wave "tails" . One of the early proposals towards testing non- 
linear aspects of general relativity using gravitational waves was due to 
Blanchet and Sathyaprakash, who first discussed the possibility of mea- 
suring these tail effects ^ . 

• Spin-orbit interactions also first make their appearance in if 3, and the 
lowest-order spin-spin interactions occur in ip^ |44j . 

• ipQ includes the cubic non-linear interactions in the scattering of gravita- 
tional waves due to the ADM mass-energy of the system [33] . 

Thus, observations of these PN contributions would allow for penetrating tests 
of the non-linear structure of general relativity. 

It is worth noting that with binary pulsars, one can only constrain the con- 
servative sector of the orbital dynamics to IPN order, and the dissipative sector 
to leading order; see, e.g., the discussion in [JS] and references therein. Hence, 
when it comes to ^{t), these observations do not fully constrain the IPN contri- 



bution. More generally, terms in ( 27 ) with n > are only accessible with direct 
gravitational wave detection. 

3.2 The parameterized post-Einsteinian formalism 

By now there is a large body of literature on alternative theories to general 
relativity, which will induce changes in the functional dependences of the ipn, 
ifn^ on component masses and spins, or even introduce new powers of v/c in 



the phase expression, Eq. (27). For instance. 
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The effect of a non-standard dispersion relation {e.g. due to a non-zero 
graviton mass) would accumulate over the large distances which the signal 
has to travel to reach the detector, and would be visible in (p2- Solar 
system dynamics bound the graviton's Compton wavelength as Xg > 10^^ 
km. Second-generation detectors will improve on this by a factor of a few; 
Einstein Telescope will probe Xg > 10^'' km, and LISA Xg > 10^^ km 

miiiiiiiiiiisniisiiiss]- 



Scalar-tensor theories add a term tpsr {v/c) to Eq. (27), due to dipolar 
emission. In Brans-Dicke theory, one has a dimensionless parameter wbd 
which leads to standard GR in the limit wbd oo. The Solar system 
bound from the Cassini spacecraft is wbd ^ 40000; LISA will improve on 
this by up to an order of magnitude 071 021 IS31 [Ml ISS] • 

A variable Newton constant adds a term ipQ^t) {v/c)~^^ [561 , and extra 
dimensions can also have this effect j57j. 

Quadratic curvature terms in the Lagrangian modify ip4 |58| . The same is 
true of dynamical Chern-Simons theory [59j . Here the second-generation 
detectors could place a bound on a dimensionful parameter of ^^^^ < 
0(10 — 100) km, six to seven orders of magnitude better than the solar 
system constraint (^^Z"* < 0(10^) km), and in this case also considerably 
better than LISA (^^/^ < 0(10^ - 10^) km)! 

Quadratic curvature terms arise in string theory compactifications [60) , and dy- 
namical Chern-Simons theory can be motivated both from string theory [61' and 
loop quantum gravity I62[ . and also arises in effective field theories of inflation 
[63j . Their effects on the phase at {v/c)"^ beyond leading order will only become 
visible when v/c is large. This is the regime we will be interested in here. 

Yunes and Pretorius established the so-called parameterized post-Einsteinian 
(ppE) framework as a way both to classify alternative theories of gravity, and 
to provide template waveforms to search for violations of GR with gravitational 
wave detectors jl3j. Their proposal involves both the phase and the amplitude 
of gravitational waves. However, since we are mostly concerned with second- 
generation detectors which for the expected stellar mass sources will not be very 
sensitive to changes in the amplitude [64| . we will focus on the phase. Instead 



of using the expression (27) for the inspiral phase, the authors of [13] proposed 
the following ansatz (again up to some reference phase) 



$(«) = E[0„-f0(')ln0](^)". (28) 
Here, the 5„ and 0„, are meant to be completely free parameters. The 



above phase reduces to the one predicted by GR, Eq. (27), for 6„ — —5, —4 



and when the phase coefficients have the standard dependences on component 
masses and spins: = (/J„(mi,m2, S'l, 52), = (pn\'mi,m2, Si, S2). Yunes 
and Pretorius also showed how a variety of alternative theories of gravity in the 
literature can be obtained by making appropriate choices for the 6„ and 0„. 



^The original proposal of |13| was formulated in the frequency domain and omitted loga- 
rithmic contributions, but the basic idea is the same. 
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Now, in the case of second-generation detectors, the form (28) may not be the 



most convenient one as far as data analysis is concerned. Indeed, even in the 
presence of a pure GR signal and using trial waveforms with the above phase, 
probability distributions arising from measurements of 6„ and 0„ might peak 
at the correct values for very high SNRs, but probably not for signals at the 
threshold of detectability and in the presence of a considerable amount of noise, 
as is expected for most detections in second-generation observatories. 

Yunes and others calculated the phase for a great variety of alternative 
theories, and in each case found the 6„ to be integer; see the examples and 
references above. It then makes sense to write 



N 



i'Mn 



(29) 



where we let the leading-order term be (v/c) ^ to allow for dipolar emission. 
This time only the and are free parameters. 

If there are too many free parameters to be determined, the measurement 
accuracy of all of the parameters will be ad ver sely affected, and we would still 
like to reduce the number of free 0„, (pif' in (29). Alternative theories that have 
a non-zero n = —2 contribution to the phase, such as scalar-tensor theories, can 
already be fairly well constrained using the electromagnetically observed binary 
pulsars With direct gravitational wave detection, the regime where we will 
be the most sensitive to GR violations is the one where v/c is large, which is 
out of reach for other observational methods. Hence we are mostly interested 
in new contributions to the phase with a power of v/c greater than or equal to 
—5. For this reason, below we will set (t>_r> = oI>-i = 0. 



3.3 A generic test of general relativity with inspiraling 
compact binaries: the TIGER method 

In probing the strong-field dynamics, one would like to be sensitive to almost 
any departure from general relativity, also through mechanisms that have yet 
to be envisaged. Hence what is needed is a test of GR that is as generic as 
possible. The possibility of such a test was first put forward by Arun et al. in 
[l8] , and the idea is illustrated in Fig. [sj If for simplicity we assume that the 
component objects have zero spins, then the GR values of the coefficients 0„, 



bn^ in Eq. (|29|) only depend on the component masses (rni,m2). Hence only 



two of them are independent, and tests of GR could be performed by comparing 
any three of them and checking for consistency]^ 

In practice, it is more convenient to make use of Bayesian inference. This 
involves the comparison of two hypotheses, namely the GR hypothesis HgRj 
and "HmodGR which posits that GR is violated. In the present context, Hgr 
will be the hypothesis that the (pn^ depend on both masses and spins in 
the standard way. Ideally, "HmodGR would be the negation of "HmodGR; but this 
is impossible in principle to evaluate, as one cannot check the observed phase 



^Needless to say, this does not mean that a completely generic test of GR is possible. In 
this picture, in principle there could be a GR violation which somehow still causes the error 
bands of any triplet of phasing coefficients to have a common region of overlap, but at the 
same, wrong component masses. See also the discussion in [65] . 
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Figure 3: A schematic illustration of how one might set up a very generic test 
of GR [TS] . The plots show the regions in the plane of the component masses 
(mi, corresponding to the 1-a measurement uncertainties on the coefficients 
(01, ip2,<f>3)- Left: If GR is correct, there will be a common region of overlap at 
the true values of the masses (here 10 and 4 Mq). Right: if there is a deviation 
from GR and one or more of the (pn do not have the dependences on masses 
that GR predicts, then there will be a mismatch. 



against all possible phase models that deviate from the GR family. Instead, 
we need to base our "HmodGR on a phase which allows for a finite-dimensional 
family of deviations. 

Inspired by [TS], we define "Hgr and "HmodGR as follows [201 121] ■ 

• ^GR is the hypothesis that all the </)„, (pn'' have the functional dependence 
on component masses and spins as predicted by GR. 

• ^modGR, is the hypothesis that one or more of the (f>n, 4>n^ (without spec- 
ifying which) do not have this functional dependence, but all others do. 

Given a detected inspiral signal in a stretch of data rf, the question is now how 
these hypotheses are to be evaluated. 

Suppose we would like to compare two hypotheses Ha and Tis- First, on 
each of them we can apply Bayes' theorem [66 . For instance, for "Hai 

P{HA\d,I)^ • (30) 

Here P('HaM, I) is the posterior probability of the hypothesis Ha given the data 
d and whatever additional information / we may hold, P(Ha|^) is the prior 
probability of the hypothesis, and P{d\'HAT I) is the evidence for Ha, which can 
be written as 

Pid\HA,I)^ J d0p{d\HA,O,I)piO\HA,I). (31) 

In this expression, p(0|Hyi,/) is the prior probability density of the unknown 
parameter vector within the model corresponding to Ha, and p{d\HA, Q, I) is 
the likelihood function for the observation d, assuming the model Ha and given 
values of the parameters 6. 

The function p{d\HA, Q, I) is what can be computed from the data. Let us 
assume that Ha corresponds to a particular gravitational wave signal model. 
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hA{6\t). In the output of a gravitational wave detector d{t), the signal will be 
combined with detector noise n{t): 

d{t)^n{t) + hA{0;t). (32) 

Let us assume that the noise is stationary and Gaussian; then its probability 
density distribution can be written as 

p[n] ^AAe-("l")/^ (33) 

where the square brackets in the left hand side indicate that p[n\ is a functional 
of n, and A/" is a normalization factor. The inner product ( • | • ) is defined as 
follows: 

«*(/)&(/) 



ia\b)=A^I -^^jff^^ (34) 







with a(/), b{f) the Fourier transforms of functions a{t), b{t). The quantity S'„(/) 



is called the noise power spectral density; comparing with Eq. (33), we see that 



it is essentially the variance of the noise as a function of frequency. Eqns. (32 1, 



(33), and (34) motivate the following form for the likelihood p{d\'HA,0,I): 

p(d|-HA,^^,/) -AAe-('^-''-^(«"')l^-''-^(«"'»/2. (35) 

Indeed, when subtracting the signal from the detector output, the expectation 
is that only stationary, Gaussian noise remains. 



Using Eq. ( 30 ) for both Ha and Hb , one can construct an odds ratio 

A _ Pmidj) ^ Pj-HMi) P{d\nAj) , . 

p{nB\dj) p{nB\i)P{d\HB.iy ^ ' 

where P{'Ha\I) / P{T~1.b\I) is the prior odds of the two hypotheses, i.e. the rela- 
tive confidence we assign to the models before any observation has taken place. 
The ratio of evidences is called the Bayes factor, which can be computed from 



the data by using Eqns. (31 ) and (|35| for hypotheses T-La and T-Lb 



._p{d\nA,i) 



- p{d\nB,iy ^''^ 



In the present context, the odds ratio of interest is 



^modGR _ -P("^modGRMi I) _ -P('HmodGR.|-^) J'(c?|"HmodGRi I) 

"^^^ " p(HgrM,/) " p(Hgr|/) P{d\nGK,i) ■ ^ ' 

The evidence P(o?|'Hgr, -^) is computed by considering a large number of GR 
waveforms with different parameters 9 to map out the likelihood function p(d|HGR, I), 



Eq. (35), which is then substituted into Eq. (31). However, the way "HmodGR 



is formulated, there is no waveform family associated with it, as there is no 
waveform model in which "one or more" of the 0„, are different from their 
GR predictions. 

To address this issue, we introduce the following auxiliary hypotheses^ 

^With minor abuse of notation, from now on wc let {(f>o, (t>2, ■ ■ ■ , 4>m} be the set of M = 10 
coefficients that are currently known from post-Newtonian calculations, including tfie "loga- 
rithmic" ones diii\ 
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Hi^i2...ik is the hypothesis that the phasing coefficients (j)i^,4>i^, . . . , 4>i^ 
do not have the functional dependence on masses and spins as pre- 
dicted by GR, but all other coefficients 0j, j ^ {ii,i2i ■ ■ ■ ,ik} do 
have the dependence as in GR. 

Thus, for example, H12 is the hypothesis that and 02 deviate from their 
GR values, but all other coefficients are as in GR. With each of the hypotheses 
above, we can associate a waveform model that can be used to test it. Let 
9 = {mi, TO2, 5*1, 5'2, . . .} be the parameters occurring in the GR waveform, 
where mi, m2 are the component masses and Si, S2 the component spins; 
other parameters include the orientation of the orbital plane with respect to the 
line of sight, sky position, and distance. Then -ffijij - ifc is tested by a waveform 
in which the independent parameters are 

{6*, (39) 

i.e. the coefficients {^ij, 4'i2, ■ ■ ■ , ipik} allowed to vary freely in addition to 
the other parameters. 

The hypothesis we are really interested in is HmodGR above, which posits 
that one or more of the (pi differ from their GR values, without specifying which. 
But this corresponds to the logical "or" of the auxiliary hypotheses: 

'HmodGR = \/ -ffiii2---jfc- (40) 

il<i2<---<ik;k<NT 

Note that in practice, it will not be possible for computational reasons to con- 
sider all possible subsets of even the 10 known phasing coefficients; hence we 
limit ourselves to the subsets of {(pi, (p2, ■ ■ ■ , 4>Nt}j where Nt < 10 is mainly set 
by computational resources. We will call the latter our testing coefficients. 
To illustrate how the auxiliary hypotheses allow us to compute the odds 



ratio CG^f™ of Eq. (381, let us consider the case of just two testing coefficients, 
{01, (/)2}- Then 

"HmodGR fflVi/2Vi/i2, (41) 

and the odds ratio becomes 

imodGR , 

p{HiyH2yHi2\d,i) 
p{nGR\d,i) 



QmodGR ^ ^ ^ ,7 ,r ■ (42) 



Now, the hypotheses Hi, H2, and H12 are logically disjoint: the "and" of any 
two of them is false. Indeed, in Hi, (p2 takes the GR value, but in H2 it differs 
from it, as it does in H12. Similarly, in H2, (pi takes the GR value, but it differs 
from it in Hi and in -ffi2. This implies 

P{Hi V i72 V Hi2\d, I) = P{Hi\d, I) + P{H2\d, I) + P{Hi2\d, I) (43) 

and hence 

^modGR ^ P{Hl\dJ) , P{H2\d,I) P{Hi2\d,I) 

p(HgrM,/) ^p(HGRM,/) p(HgrM,/)- ^ ^ 



Using Bayes' theorem (30) on each term, we get 



^n.odGR_ Pmi) „i P{H2\I) 2 I P{Hi2\I) 12 

" P(Hgr|/)^^^ + P(Hgr|/)^°^ + P(Hgr|/)^°^' ^^^^ 
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where the Bayes factors Bq^, Sqj^, and are given by 

1 P{d\H,,I) 2 P{d\H,,I) 12 _ P{d\H,^,I) 

P{d\Hon,iy ^«« "P(d|HGR,/)' P{d\HoK,iy ^ 

These can be computed from the data, as explained in the discussion leading 
up to Eq. ( [37| . However, a choice will have to be made for the relative prior 

oddsP(i/i|/)/P(HGR|/), P{H2\I)/PinGR\I), andP(i7i2|/)/P(HGR|/). If one 
believed that the graviton has mass, then a deviation in 02 would be the thing 
to look for, and the auxiliary hypothesis H2 should have more weight than either 
Hi or H12. On the other hand, one's favorite alternative theory might predict a 
violation in (pi instead, in which case Hi should have more prior weight. Or, one 
might expect a GR violation to affect all phasing coefficients at the same time, 
so that H12 is a priori preferred. The method presented here is meant to find 
generic deviations from GR, with no preference for any particular alternative 
theory; consequently, we set 

P{Hi\I) _ P{H2\I) _ P{Hi2\I) ^^^^ 



P{nGR\I) P{nGn\I) P(Hgr|/)' 

We will also need to specify the overall prior odds for HmodGR against Hgr- 
Here we simply set 

PCHmodGKlI) ^ PjHiy H^y Hi2\I) ^ 

P(Hgr|/) PiHGRlI) ^ ' 

where the constant a will end up being an unimportant overall scaling factor in 



the odds ratio. Eqns. (47), (48) imply 



P{Hi\I) _ P{H2\I) _ P{Hi2\I) _ a ^^g^ 



P(Hgr|/) P(Hgr|/) P(Hgr|/) 3' 

and, except for the overall factor a, the final expression for the odds ratio reduces 
to a straightforward average of the Bayes factors for the auxiliary hypotheses 
against GR: 

OgodGR ^ ""f-^^^i)^ - i [^^H + Bl,^ + ] . (50) 

Thus, although there is no waveform model with which to directly test the hy- 
pothesis "HniodGRi thanks to the auxiliary hypotheses it is nevertheless possible 
to compute its posterior probability relative to that of GR. 

In the above example we used only two testing parameters, but in practice 
one will want to have more. With Nt testing parameters {(f>i, ■ ■ ■ ,0Ary} and 



making similar choices to Eqns. (47 1, (48), the odds ratio will again be propor- 



tional to an average of the Bayes factors for the auxiliary hypotheses against 
GR gU]: 

Nt 



-imodGR _ ^ 

- 1 

k=l ii<i2<---<ik 



OSr""=2iv^E E ^GR^-"- (51) 



Combining data from multiple observed inspiral events will make for a far 
more robust test of GR compared to using just one detection. Suppose one has 
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M independent detections in stretches of data di, ^2, . . . , dj\f. Assuming these to 
be independent, it is not difficult to show that the odds ratio for the "catalog" 
of detections as a whole takes the form [20] 

^/^modGR _ -P("^modGRMli ^^2, ■ ■ ■ , , I) 
«^ " P(HGR|rfl,d2,...,dAA,/) 

Nt M 

= E (52) 

k=l ii<i2<---<ik A=l 

i.e., for each auxiliary hypothesis, one multiplies together all the Bayes factors 
against GR for individual sources, ('4)^^^2 - »fc ^ after which one takes the average 
over all these hypotheses. 

The algorithm described here was developed by Li et al. [lUl HI] . It has 
been dubbed the TIGER method ( "Test Infrastructure for GEneral Relativity" ) , 
and a hands-on data analysis pipeline for use on the upcoming detections in 
Advanced LIGO and Virgo data has been developed based on this idea. It has 
a number of benefits: 

• Unlike previous Bayesian treatments such as [IHIISI], it addresses the ques- 
tion "Do one or more testing parameters deviate from their GR values?" , 
as opposed to "Do all of them deviate?" . Bayesian analysis naturally in- 
cludes the idea of Occam's Razor in a quantitative way, and if the full 
non-GR model happens to have too many free parameters then one will 
be penalized for it [66] , 

• It is well-suited to a situation where most sources are near the threshold 
of detect ability. As shown in [20], if a GR violation is small, the Bayes 
factor for the "correct" auxiliary hypothesis (if any) will not always make 
the largest contribution to the odds ratio, as detector noise can obfuscate 
the precise nature of the GR violation. Even then, the GR hypothesis will 
typically be disfavored compared to one or more of the other auxiliary 
hypotheses, causing the GR violation to be detected after all. 

• In combining information from multiple sources, it is not necessary that a 
GR violation manifests itself in the same way from one source to another. 
A deviation from GR could depend on mass, and on whatever additional 
charges might be present in the correct theory of gravity. However, in the 
above, the same "yes/no" question is asked for every source, and evidence 
for or against GR is allowed to build up as more and more sources get 
added. 

• The method is not restricted to just the inspiral phase. It could equally 
well be applied to ringdown (as discussed below), or for that matter to 
alternative polarization states. All that is needed is a convenient param- 
eterization of possible deviations from GR, such as provided by (general- 
izations of) the parameterized post-Einsteinian formalism. 

3.4 Accuracy in probing the strong-field dynamics with 
second-generation detectors 

Let us consider some examples to gauge how sensitive the TIGER method will be 
for particular (though heuristic) violations of GR, with the network of Advanced 
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LIGO and Virgo detectors. In order to do this, one can produce simulated sta- 
tionary, Gaussian detector noise, whose power spectral density (essentially the 
variance of the noise as a function of frequency) is in accordance with predictions 
for the Advanced LIGO and Virgo interferometers in their final configurations, 
projected for the 2019-2021 time frame [HIIZ]- Simulated signals can be added 
to this simulated noise. 

First we consider binary neutron stars. For such sources, the inspiral signal 
ends at high frequencies, and to good approximation one can assume that only 
this part of the coalescence process is visible in the frequency band where the 
detectors are sensitive. Moreover, in BNS systems the dimensionless intrinsic 
spins of the components are expected to be small: cJ/{Gw?) < 0.05 [S7], with 
J the spin and m the component mass. Finally, for most of the inspiral, neutron 
stars can be treated as point particles; finite size and matter effects will only 
be important at relatively high frequencies where the detectors are not very 
sensitive |68j . Thus, binary neutron star inspirals are relatively clean systems 
whose GW emission can be described by fairly simple waveform models [55] . 
Indeed, a hands-on data analysis pipeline which starts from raw detector data 
and computes Ogr^^^ already been developed. 

To have a fair assessment of how the method will perform with second- 
generation detectors, the simulated BNS sources will have to be distributed in 
an astrophysically realistic way. We will assume the component masses to be 
uniform in the interval [1,2]Mq. The normal to the inspiral plane, and the 
sky position, are taken from a uniform distribution on the sphere, and sources 
are distributed uniformly in volume. Distances are between 100 and 400 Mpc; 
the former is the distance within which one can realistically expect one inspiral 
event every two years, and the latter is approximately the largest distance at 
which an optimally oriented and positioned system is still visible with Advanced 
LIGO [F. Finally, many simulated catalogs of 15 sources each are produced. 

We will be interested in GR violations that affect contributions to the phase 
(29) with n > 0; as mentioned before, we expect novel effects to show up for 
large v/c. Therefore, let us choose as testing coefficients the set (/)2, (/la}, 
leading to 2'^ — 1 = 7 auxiliary hypotheses that need to be compared with Hgr 
in order to compute the odds ratio Cqr'^'^^^ - 

A priori, one would expect O^"^^^ > 1 if GR is violated, and C'gr'^^^ < 1 
if it is correct. However, detector noise can mimick GR violations, so that 
occasionally one will have Cqr*^^ > 1 even when GR is in fact correct. To 
deal with this, one can compute odds ratios for a large number of catalogs of 
simulated sources whose emission is in accordance with GR, but having different 
parameter values within the above ranges, and see how the odds ratio ends up 
being distributed. For a given kind of GR violation, one can similarly construct 
an odds ratio distribution for catalogs of simulated sources. If the non-GR 
distribution has significant overlap with the GR distribution, then the particular 
GR violation considered would not necessarily be detected with great confidence. 
If, on the other hand, the two distributions are perfectly separated then the GR 
violation will be incontrovertibly detectable. 

As we have seen, non-linearities related to "tail" effects first show up at 
1.5PN order, i.e. in (/)3, so that this contribution is of particular interest. In 
order to gauge how sensitive the method might be to GR violations at this 
order, in [50] a constant relative shift in 03 was considered: (f)^ = (1 -l-^xs) 0^^- 
This was compared with the GR case, and it was found that for 5x3, = 0.1, 
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there is complete separation between the odds ratio distributions for the GR 
and non-GR catalogs, so that a violation of this kind and size would certainly 
be discovered. This is shown in the top panel of Fig. |4j 

What if a deviation from GR does not manifest itself as simple shifts in the 
phase coefHcients? Naively one might think that a more general phase model 



as in Eq. ( 28 ) would then be needed to uncover the GR violation. To show that 



this is not the case, Li et al. [21 considered a heuristic violation of the form 

-6+M/Mq 



(«)^<i>G«(t;) + ^ (-) , (53) 

with $'^'^(ti) the GR phase. The prefactor /3 was chosen to be of the same order 
as the (pn predicted by GR (see [H] for details), and M is the total mass, so that 
the power of v/c in the extra term varies from effectively 0.5PN to effectively 
1.5PN within the BNS mass range considered. However, in order to compute 



Oqp'^'^'^ , the phase model used was still that of Eq. (|29|) with integer n, and the 
testing parameters were {4>i,4'2,4'3}j as before. As shown in the bottom panel 
of Fig. |4j also in this eventuality the GR hypothesis Hgr will be disfavored 
compared with one or more of the Hi-^^i^,,,i^. The separation between GR and 
non-GR source catalogs is complete. The odds ratio C'gr'^'^^^ indeed provides a 
Bayesian realization of the basic idea sketched in Fig. [3j inspired by Arun et al. 



3.5 Binary neutron stars versus binary black holes 

As mentioned before, in the case of binary neutron stars, it is mostly only the 
inspiral part that is within the sensitivity band of the detector, so that we do not 
have to worry about the messy merger process. Finite size and matter effects 
mostly appear at high frequencies, where they have little impact. Neutron stars 
in binaries are expected to be relatively slowly spinning, and also this aspect can 
be dealt with. Consequently, a simple waveform model can be used for which 
data analysis algorithms are sufficiently fast, and a full data analysis pipeline 
for testing GR with BNS signals is already in place. 

The situation is quite different for binary black holes. The frequency at 
which the inspiral terminates is roug hly cV(6^/^7rGM), with M the total mass. 
For a BBH with component masses of (10, 1O)M0, this is approximately 220 
Hz, close to the frequency of ^ 150 Hz where the detectors will be the most 
sensitive. Thus, the merger part of the signal, which is still not well modeled, 
will play a major role. Moreover, astrophysical black holes are expected to be 
fast-spinning, with dimensionless spins cJ/ (Gm^) — 0.3 — 0.99 [7^. If the spins 
are not aligned with each other and the orbital angular momentum, then all 
three of these vectors will undergo precession during the inspiral phase |71l I72j . 
Since the unit vector L in the direction of orbital angular momentum is also 
the unit normal to the inspiral plane, the latter will undergo precession, and 
in extreme cases even a tumbling motion. This behavior is imprinted onto 
the gravitational wave emission through modulation of both the amplitude and 
the frequency of the waveform, as shown in Fig. [5j The rich dynamics that is 
unleashed in this way makes binary black holes far more interesting systems to 
study, but the more complicated signals also make the data analysis problem a 
great deal more difficult. 
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Figure 4: Top: Distributions of log odds ratios for many catalogs of 15 BNS 
sources each, with and without a 10% shift in (/13, the 1.5PN phasing coefficient 
which contains the leading-order non-linearities of GR related to tail effects [20] . 
The blue, dotted distribution is for sources with pure GR emission; we see that 
mostly InOgj^'^*^^ < 0, although noise will occasionally mimick a GR violation, 
causing the tail towards positive In Ogg^^*^^ . The red, dashed distribution is 
for sources with the 10% shift at 1.5PN. The two distributions are perfectly 
separated, indicating that a violation of this type and magnitude will easily be 
detected. Bottom: The same for a violation which does not manifest itself as 
a simple shift in one or more of the phasing coefficients (see the main text for 
details), but Cq^*^^^ is still computed in exactly the same way as before [H]. 
Here too, there is complete separation between GR and non-GR. 
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Figure 5: For systems with no spins, the frequency and amphtude of gravita- 
tional waveforms increases in a steady "chirp" (top). If there are significant 
spins which are not aligned with each other and the orbital angular momen- 
tum, then one has precession of the orbital plane, causing modulation of the 
amplitude and frequency. (Courtesy B.S. Sathyaprakash.) 

Large-scale numerical simulations provide us with accurate waveform models 
[73], but they take a long time to compute and can not be used in data analysis, 
where many thousands of trial waveforms need to be compared with the data 
to arrive at accurate parameter estimation. On the other hand, semi-analytic 
inspiral-merger-ringdown waveforms are under construction, which roughly fall 
into two categories. In the Effective One- Body formalism, the inspiral includes 
part of the final plunge, and a ringdown waveform can be "stitched" to it; the 
waveform as a whole can then be further "tuned" against numerical results [71] . 
There is also a variety of phenomenological waveform models which are similarly 
improved using numerical predictions [75l [76l [77] . These are achieving matches 
> 0.99 with numerically predicted signals; however, so far the only inspiral- 
merger-ringdown waveform with fully precessing spins is the one of Refs. [77], 
which has been tuned against only a limited number of numerical waveforms. 

Because of these difficulties, TIGER can not yet be extended for use on BBH 
signals. However, the authors of [SO] [21] made some rough estimates of what 
might be achievable once appropriate waveform models are available. Fig. [6] 
shows the log odds ratio distributions for catalogs of simulated BBH signals 
with GR emission, and with a 0.5% shift in (f>Q, which encodes the cubic non- 
linear interactions of the scattering of gravitational wave tails by the ADM 
mass-energy. In both cases, the inspiral-merger-ringdown waveform model used 
was the aligned-spin approximant of [76], and the set of testing parameters 
consisted of (the analogs of) only {</>i, ^('2, ^3, </'4}|^ Hence the the coefficient 
containing the deviation was not among the testing parameters. However, even 
though both Hgr and the are inconsistent with the signal, the waveform 

models of the latter have more freedom and can arrive at a closer fit, causing the 

®Note that the number of auxihary hypotheses grows with the number of testing 

parameters Nt as — 1, and the data analysis problem can become computationally very 
costly if too many are used. 
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GR hypothesis to be disfavored. And indeed, there is near-complete separation 
between GR and non-GR! On the other hand, precessing spins are bound to 
affect these results, in a way that is as yet unknown. 
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Figure 6: An estimate by the authors of [20l EI] of how well one would be able 
to probe deviations from GR at high post-Newtonian order with binary black 
holes. As before, the blue dotted histogram is for catalogs of GR sources. The 
red dashed histogram is for sources with a 0.5% deviation at 3PN order, where 
cubic non-linear self-interaction of the gravitational field appears. The testing 
coefficients were {(/>i, ^2, ^3, '/'4} and hence did not include the parameter (j)Q 
where the GR violation actually occurs; nevertheless, there is near-complete 
separation between GR and non-GR. 



4 Testing the no hair theorem 

In Newtonian theory, the gravitational potential $ caused by a body with den- 
sity p satisfies 

= AnGp in the interior, (54) 
= in the exterior. (55) 

In the exterior, $ can be expanded as 

= -GE7St^™(^''^)' (56) 

and the multipole moments are obtained by demanding consistency be- 

tween the interior and exterior solutions. For axially symmetric objects, only 
terms with m = contribute. The lowest-order multipole, Mqo, is just the total 
mass of the body. By appropriately choosing the center of the coordinate sys- 
tem used, one can set Afio — 0. The next non-trivial multipole moment is M20, 
the quadrupolc moment, which has dimensions ML^. The set of all multipole 
moments uniquely determines the shape of the potential $, and by measuring 
them one can study the properties of the mass distribution that gives rise to it. 
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In general relativity, the spacetimes outside bodies can similarly be described 
by a set of multipole moments. Spacetimes that are stationary, axisymmetric, 
reflection symmetric across the equatorial plane, and asymptotically flat - an 
example being the Kerr black hole - are characterized by two sets of multipole 
moments: mass multipole moments Mq, M2, M4, and current (or spin) 
multipole moments Si, S3, S5, ... . Mq = M is the mass, Si = J is the spin, 
and M2 is the mass quadrupole moment. Now, according to the no hair theorem 
[22], the multipole moments of quiescent black holes with the above properties 
satisfy 

Mi+iSi = M{iay, (57) 

where a = J/M. Hence only two of them are independent: a quiescent black 
hole can be characterized completely by its mass M and spin J. Measuring any 
three of the multipole moments and checking consistency with the above relation 
would constitute a test of general relativity. The Mi and Si have dimensions 
of (mass)'"*"^, and it is convenient to instead use dimensionless quantities m; = 
Mi/M'-+^ and S; = Si/M'+^, as we shall do below. 



4.1 Ringdown 

At the end of inspiral, binary neutron stars or black holes plunge towards each 
other to form a single, highly excited black hole, which will then undergo "ring- 
down" as it evolves to a quiescent, Kerr black hole. This process can be modeled 
as perturbations around a Kerr background, subject to the Einstein equations. 
For a black hole with mass M at a distance D, the "plus" and "cross" polariza- 
tions then take the form of damped sinusoids, the quasi-normal modes (QNMs) 

mU 

M 

= — ^Aj™r|"e-*/^"" cos(c.,„t-m(/.), (58) 
M 

hy.{t) = -— ^Ai„y^"e-*/^"" sin(t^i,„t-TO0), (59) 

l.m 

with (/) a phase offset, and F^™ are linear combinations of spin- weighted 

spherical harmonics -2^''" |80) . 

r["(0 = _2i"'"(i,o) + (-i)'_2y'-™(^,o), (60) 

Yirii) = _2i"'"(t,0)-(-l)'-2r'-™(^,0), (61) 

with L the angle between the direction of the black hole's intrinsic angular mo- 
mentum and the line of sight to the observer. 



The damping times and mode frequencies r/™ {M, J) , w/™ (M, J) in Eqns. ( 58 ) , 



( 59 1 only depend on the black hole mass M and its spin J pTj . Hence, in general 
relativity only two of the Tim and ujim are independent, which opens up the pos- 
sibility of a test of GR, similar to the one described above for the case of inspiral. 
This would effectively be a test of the no hair theorem. Indeed, the reason why 
frequencies and damping times only depend on these two quantities is that (a) 



Strictly speaking there is another integer n characterizing the polarizations 179!. However, 
overtones with ra ^ have small amplitudes and short damping times; we will not consider 
them here. 
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the background spacetime around which one considers perturbations is assumed 
to be Kerr, and (b) the perturbative Einstein equations are assumed vaHd on 
this spacetime background, forcing relationships between damping frequencies 
and times. 

Gossan, Veitch, and Sathyaprakash [21] demonstrated how one can exploit 
the interdependences of the r/m, ujim to test GR with Einstein Telescope as well 
as space-based detectors. For simplicity, they assumed that the spins of the 
progenitor objects were zero, in which case the amplitudes Aim in ( 58 ) , ( 59 1 only 
depend on the symmetric mass ratio r] = mim2 / (rni + m2)^. Using data from 
numerical simulations in [82] . they arrived at an analytic fit for the amplitudes 
^21 J ^22 J ^33: and A44 of the four most dominant modes as a function of 77, 
and the damping times tI^{M, J) and QNM frequencies uj^^{M, J) predicted 
by GR were modeled using simple analytic fits from Berti et al. [7S]- It was 
then assumed that the true damping times Tim and LOim might deviate from the 
GR prediction by dimensionless relative shifts Af;„i and Aa);m, respectively: 



Tim = il + Afim)T^^M,J), 
UJlm = [1 + l^UJlm)0J?m{M.J). 



(62) 
(63) 



where in the case of GR, Af;„i — Aw/,,, — for all / and m. The full parameter 
space for the "deviating" model "Hdcv was then 



AJ)/m, 6*}, 



(64) 



with 6 = {M, J, . . .} the parameters of the GR waveform. In practice, only a 
limited number of frequencies and damping times were allowed to be non-zero, 
leading to a parameter space 



{Aii22, Af22, Acjas,^}. 



(65) 



With the second-generation detectors, it is unlikely that much more than 
the 22 mode will be distinguishable. As shown by Kamaretsos et al. [HI], the 
situation is quite different for Einstein Telescope or a space-based detector such 
as LISA, where the 21, 22, 33, and 44 modes can all contribute signicantly to the 
signal-to-noise ratio. Numerical experiments were performed in which simulated 
signals were added to stationary, Gaussian noise following the projected noise 
power spectral densities of Einstein Telescope and LISA. The sensitivity to GR 



violations of the type (62), (63) was then checked by two methods: 



• Direct parameter estimation. Given data d and the signal model _Hdcvj 
the posterior probability distribution for the parameters A of Eq. ( 64 ) is 
given by 

pidlTidcv, A, I)p{X\ndcv,I) 



p{X\d,ndcv,I) - 

p{d\ndcv,I) 

where we have used Bayes' theorem. One has 

pidindcv,!) = / dXp{d\ndcv,X,I)p{X\'Hdcv,I) 



(66) 



(67) 



with p{X\'Hdcv, I) the prior distribution of the parameters. p(d|Hdcv, A, /) 
is the likelihood of the data given parameters A, as in Eq. ( 35 ) : 



p(d|Hdev, A, /) = AAe-('^-''(^)l'^-'^(^))/^ 



(68) 
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where /i(A; t) is the waveform family corresponding to Hdev Posterior dis- 
tributions for parameters like Af;„j and Aw/™ are obtained by integrating 



the posterior probability density ( 66 ) over all the other parameters 



• Model selection. Here two models were considered: the GR model "Hgr 
in which Afjm = Aa);^ = and only the 9 are free parameters, and the 
"deviating" model T^dov in which the Afim and Auiim are allowed to vary 
on top of the 9. One then computes an odds ratio 

P(Hdcv|rf,/) 

P{nGR\I) PidinonJ)' ^ ' 

The ratio of prior probabilities, PiHdcvlI) / PiHoRlI) , is just a constant 
overall prefactor, which for convenience can be set to one. 

With parameter estimation, it was found that a 10% deviation in 0^22 would 
be clearly visible for a 500 Mq black hole at 1.25 Gpc in ET, and for 10^ Mq 
and 10^ Mq at 1.25 Gpc and 10 Gpc, respectively, in LISA. 500 coalescences 
are expected to be rare within distances of 1.25 Gpc. By contrast, the quoted 
mass and distance range for LISA is expected to correspond to a detection rate 
of tens per year [83 j . 

Bayesian model selection leads to rather better results. A 10% deviation in 
W22 would be visible for 500 Mq at Dl — 6 Gpc in Einstein Telescope, and for 
10^ Mq at a similar distance with LISA. A 0.6% deviation could be picked up 
at a redshift of 5 with a 10^ Mq source in LISA. 

The above results are for black holes resulting from a binary with non- 
spinning components. However, in [25], it was shown that in the case where both 
components have spins, the ringdown mode amplitudes retain a memory 
not only of the progenitor's mass ratio, but also of spins. 



We end this subsection with an important comment. In the odds ratio (70 1, 



the hypothesis Udev is not the equivalent of our HmodGR the case of inspiral 



(Sec. 3.3). To see this, denote the "testing parameters" by 

{(f)l, (1)2,(1)3) = (Ac2;22, Af22, Ad)33), (71) 

where in this case, (j)i ~ for i = 1,2,3 corresponds to GR. In the language 
of TIGER, one then has Hdcv = H123, the hypothesis in which all three pa- 
rameters at the same time are different from the GR prediction. Indeed, in 
the "deviating" waveform model, all of the testing parameters are allowed to 
vary freely, but e.g. each of the hypersurfaces (j)i = have zero measure in the 
model's parameter space, and zero prior probability. In this notation and with 
the prior odds for 'Hdcv against Hgr set to some arbitrary a, 

O^R=«i?GR- (72) 

The resolvability of anomalies in the testing parameters would no doubt improve 
if instead one were to compute an odds ratio 

^modGR _ PCHnwdGRjd, I) , ^ 

" P{noK\d,i) ^^^^ 

3 

a \ ^ \ - 



lE E ^gr". (74) 



23 . 

A:— 1 ii<...<i 
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completely analogously to ( 51 ), possibly with a larger number of testing param- 
eters than just {Adj22, Af22, Awas}. It would also be of great interest to see 
what happens if information from multiple sources is combined, 



/-/-jmodGR, 



Pin 



modGR 



\di,. . . ,d^f,I) 



P(HGRMl,...,rfAA,/) 



(75) 



analogously to (52 1 



4.2 Extreme mass ratio inspirals 

Extreme mass ratio inspirals (EMRIs) consist of a very massive black hole (or 
boson star [51], or naked singularity, ...) surrounded by a smaller object, which 
could be a neutron star or a stellar mass black hole. In the case of Einstein 
Telescope, target systems would have a massive component of a few hundred 
solar masses [85], while for space-based detectors the mass would be in the 
range 10^ — 10^ Mq [55] . EMRIs provide another avenue to testing the no hair 
theorem: the orbits are expected to be extremely complicated, and in the case 
of LISA, there will be a large number of gravitational wave cycles within the 
detector's frequency band. As a consequence, the gravitational wave emission 
of the smaller object will bear a detailed imprint of the spacetime in the vicinity 
of the massive object. 

Ryan was the first to evaluate the measurability of multipole moments using 
EMRI signals in Advanced LIGO and LISA [53] . With a number of simplifying 
assumptions - the most important one being circularity of the orbit of the 
smaller object, which moreover is taken to move in the equatorial plane - one can 
write down an expression for the phase in the Fourier domain explicitly showing 
the dependence on the multipoles rri/, S;. In particular, Si first appears at 1.5PN 
order, and m2 at 2PN. In the case of Advanced LIGO, Ryan's conclusion was 
that even assuming mi = 30 Mq (a very heavy stellar mass black hole) and 
m2 — 0.2 (an implausibly light neutron star), it would be hard to independently 
measure Si and Tn2 with a single inspiral event near SNR threshold. For LISA, 
the situation is quite different. Assuming toi = 10^ Mq, m2 — 10 Mq, and SNR 
= 100, one obtains l-cr measurement accuracies of Asi ^ 10~^ and Am2 ^ 
1.5 X 10~^. This would allow for a precision test of the no hair theorem. 

Subsequent to Ryan's seminal paper, a number of authors have relaxed his 
assumptions. Collins and Hughes developed a treatment of multipole moments 
that is more appropriate than Ryan's in the strong-field regime |87j through the 
notion of "bumpy black holes" . The motion of the smaller object will not be 
quasi-circular; Glampedakis and Babak employed so-called kludge waveforms 
which encode the essentials of the orbital motion [55]. Barack and Cutler [5^ 
showed that with kludge waveforms, results for the measurability of low-order 
multipole moments are qualitatively in keeping with those of Ryan. Vigeland 
and Hughes studied orbits around "bumpy black holes" , showing how a space- 
time's bumps are imprinted onto the orbital frequencies [50] • Recently, Vigeland, 
Yunes and Stein studied bumpy black holes in alternative theories of gravity [5T] . 

Most recently, Rodriguez et al. performed a more in-depth study of the pos- 
sibility of using second-generation detectors to test the no hair theorem, also 
assuming a more reasonable mass for the lighter object (1.4 Mq), and masses 
between 10 Mq and 150 Mq for the heavier one [HI]. This work still mostly con- 
sidered parameter estimation; it would be of great interest to cast the problem 
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in terms of hypothesis testing, in which case results from multiple sources could 
be combined. 



5 Probing the large-scale structure of spacetime 
5.1 Binary inspirals as standard sirens 

Assuming that at large scales the Universe is homogeneous and isotropic, its 
line element can be put in the Friedman-Lemaitre-Robertson- Walker (FLRW) 
form [1]: 



dr^ 



1 — fcr^ 



(76) 



where the entire dynamical content resides in the evolution of the scale fac- 
tor a{t). The constant k can be positive, zero, or negative, in which case the 
t = const spatial hypersurfaces are hyperspherical, flat, or hyperboloidal, respec- 
tively. Given a homogeneous mass distribution p with pressure P, the Einstein 
equations reduce to two equations for a(i), p{t), and P{t), called the Friedman 
equations. 



Stt 



47r 



k 



(P + 3P) 



(77) 
(78) 



which can be combined to arrive at an equation for the time evolution of the 
density: 

p^-3{p + P)^. (79) 

This can be solved given an equation of state P = P{p). In the case of pres- 
sureless dust (which can serve as a model for a sprinkling of galaxies), P = 0, 
and p oc . For radiation, P — p/3, leading to p oc a~^. Finally, there is 
evidence that the expansion of the Universe is speeding up j35[ . The cause is 
unknown, but it is convenient to model it as dark energy, a perfect fluid with 
positive density but negative pressure. Postulating an equation of state of the 
form P{t) — w{t) p{t) with w{t) < 0, one can once again solve Eq. (79 1 to obtain 
an expression for p as a function of the scale factor a. 



Using this and the first Friedman equation ( 77 1 , the way the Universe evolves 



depending on its contents can be expressed through the Hubble parameter H{a), 
defined as 



(80) 



ftyio, + ^na + ilkO- " + ^^DE exp 3 



" da! 



:i + w(a')] 



(81) 



where iJo is the Hubble constant, which gives the expansion of the Universe 
at the current epoch, and the dimensionless quantities JIm, ^^r, ^fc, and riDE 
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are the fractional contributions to the total density of, respectively, matter, 
radiation, spatial curvature, and dark energy: 



M 



Stt 



Pm,o, 



Stt 
3^ 



Pr,o, 



DE 



Stt 



PDE.O, (82) 



with pm,Oi Pr,Oi Pde,o the densities at the current epoch of matter, radiation, 
and dark energy, respectively. 

An important task in cosmology is to determine 



(83) 



and especially to gain empirical insight into the enigmatic dark energy, by mea- 
suring its equation-of-state parameter w{t). The main tools for studying the 
late-time evolution of the Universe are standard candles. These are distance 
markers for which both the redshift z and the luminosity distance Z?l are 
known. For a source with intrinsic luminosity C and observed flux Di^ is 
defined through 

If the Universe were Euclidean and never-changing, Dl would correspond to 
the familiar Euclidean notion of distance. However, due to the evolution of the 
Universe, Dl and z are related in a complicated way: 



|fcrV2 3in[|A:|i/2/^-^ 

DUz) = c{i + z) <; 



for rife < 0, 
for rife = 0, 
for fife > 0, 



(85) 



where H{z) is the Hubble parameter as a function of redshift. Since radiation 
will not be very important at late times, one can write (using 1/a = 1 -I- z) 



H{z) - Ho [fluil + zf + r!fc(l + zf + (1 - riM - ^k) E{z)\ 



1/2 



(86) 



where E{z) depends on the equation of state of dark energy. At late times, one 
can expand w(t), or equivalently w(z), as 



w{z) 



Pde/pde = Wo + Wa{l ~a)+0[{l- af 



(87) 



Wo + Wa 



l + z' 



in which case 



E{z) = (1 + z)3(l+«'o+^„) g-3^„./(l+^)_ 



(89) 



From Eqns. (85 1 and ( p6| , it will be clear that given a large number of astro- 
physical sources for which the pairs (Dl, z) can be measured, one can constrain 
the parameters ( 83 ) . 

The most commonly used standard candles are Type la supernovae, whose 
luminosity is believed to be known within ~ 10% |35j . However, this luminosity 
needs to be calibrated by comparison with different kinds of closer-by sources, 
leading to a "cosmic distance ladder" , each rung of which could contain unknown 
systematic errors. As pointed out by Schutz in 1986, GW signals from inspiraling 
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neutron stars and black holes can provide an absolute measure of distance, with 
no dependence on other sources In the context of cosmology, they have 

been dubbed standard sirens. To see how this works, consider the amplitude of 
an inspiral signal as a function of time: 

At) = -^M^/^g{9, i, V) uj^'\t). (90) 

Here M = {mim2)^^^ / {mi + 7712)^^^ is the chirp mass, g{6,(f>, l,^) is a known 
function of the sky position {9,(f)) and orientation of the orbital plane {l,iP), 
and ui{t) = <i>(t) is the instantaneous frequency. The chirp mass, and of course 
Lo(t), can be obtained from the phase. Thus, if sky position and orientation are 
known, then from the amplitude one can infer the luminosity distance Dl- 



5.2 Cosmography with gravitational wave detectors 

To make use of binary inspirals as standard sirens, what is needed is a way to 
obtain some information about redshift z, and also about the sky position (0, 0) 
and orientation (t, tp) so that the luminosity distance Z^l can be obtained from 
the GW amplitude. A variety of methods have been proposed to achieve this. 



5.2.1 Using electromagnetic counterparts 

Gamma ray bursts (GRBs) are among the most energetic electromagnetic events 
since the Big Bang. They roughly fall into two categories: short, hard GRBs and 
long, soft ones. It is believed (although only direct GW detection will provide 
a definitive answer) that short, hard GRBs are caused by the coalescence of 
two neutron stars, or a neutron star and a black hole [40 . Sometimes a GRB 
can be localized on the sky, providing {9, (/)). If this allows for the identification 
of the galaxy that was host to the inspiral event, then from its spectrum one 
can infer the redshift z. Finally, given a network of detectors, some information 
about the orientation (i, 0) can also be obtained. Additionally, it is possible that 
GRBs are strongly beamed in a direction perpendicular to the inspiral plane, 
with inclination angle l < 20°. 

In j27| , Nissanke et al. investigated with what accuracy a network of ad- 
vanced detectors would be able to do cosmology. We first note that with second- 
generation detectors, the maximum redshift out to which inspirals can be seen 
is z ~ 0.1 for BNS and z ~ 0.2 for NSBH. For small redshifts, the luminosity 



distance- redshift relationship, Eq. (85), reduces to 



Di.^^, (91) 

which is just Bubble's law. This means that with advanced detectors, we will 
only be able to probe the Hubble constant i/o. However, since Hq is an overall 
scaling factor in the full expression for Z?l, its accurate and unbiased measure- 
ment is key to precision cosmology at the largest scales. Gravitational wave 
detection will provide us with a way of measuring Hq without having to rely on 



any kind of cosmic distance ladder. Note that from Eq. (91 1, if redshift can be 
determined with essentially zero uncertainty, then the uncertainty AiJo on the 
Hubble parameter is related to the distance uncertainty by 

^^^^ 
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This pertains to a single source; the accuracy will improve roughly as ^ \/J7 
for Af events. Nissanke et al. found that with a network composed of the two 
Advanced LIGOs and Advanced Virgo, with Af — 4 BNS events one already has 
AHo/Ho ~ 13%, and with Af = 15, AHq/Hq ~ 5% [27j. 

With Einstein Telescope it would be possible to see BNS events out to red- 
shifts of several. In |30j and |3I| , detailed studies were made of how accurately 



the full set of cosmological parameters (83) could be measured. With 0(1000) 
events with identifiable host galaxies over the course of 5-10 years, f^M and 
i^DE could be constrained with an uncertainties comparable to what one finds 
in measurements of the Cosmic Microwave Background (CMB). One can also 
use the CMB measurements for JIm, ^^dEj and fl^,, and their uncertainties, as 
priors, and focus on the dark energy equation of state parameter w, including its 



possible time dependence. Using a linear approximation to 'w{z) as in Eq. (88 1, 
one can then compare accuracies in measuring wq and Wa, on the one hand 
using standard sirens seen by ET, and on the other hand considering the SNAP 
Type la supernova survey which may be available on the same timescale as ET 
[55] . The results are shown in Fig. [7j The measurement quality is comparable 
in the two cases, but we stress once again that standard sirens allow for an 
independent measurement, with no need for a cosmic distance ladder. 
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Figure 7: Measurement uncertainties for the possible time dependence in the 
dark energy equation of state parameter w, modeled as w{a) = wq + {I — 
ajWa, with a the scale factor '3T]. The slightly larger, red error ellipse is for 
standard sirens as seen with ET, the blue one for the possible future SNAP 
Type la supernova survey |93j . In both cases, prior information from the CMB 
is assumed for JIm, ^^de, and Qk- 



5.2.2 Using a prior on the intrinsic neutron star masses 

Currently there are about 10 electromagnetically observed binary neutron star 
systems, with varying degrees of compactness. The distribution of neutron star 
masses in these binaries is relatively tight [94], with mean /ins — 1-34 and 
standard deviation ctns — 0.06 M©. Now, the masses that are measured from 
a gravitational wave signal are not the physical ones mphys, but the redshifted 
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masses mobs; for a source at redshift z, one has 

mobs = (1 + z) mphys- (93) 

As shown by Taylor, Gair, and Mandel, assuming an underlying, physical dis- 
tribution of masses and comparing with the observed masses, one can obtain 
information about the redshifts of events without ever needing an electromag- 
netic counterpart [5H]. With ^ 100 BNS observations, a network of second- 
generation detectors would then allow the measurement of the Hubble constant 
with ^ 10% uncertainty. 

Taylor and Gair applied this idea to a network of Einstein Telescopes, in 
which case one might have as many as lO'^ BNS signals per year [32]. Keeping 
Hq, S^DE; arid Qk fixed, they found that also with this method, the dark 



energy equation of state parameters {wq, Wa) in Eq. ( 88 ) could be measured with 
an accuracy comparable to the forecasted constraints from future SNIa surveys 
with CMB and other results as priors. 



5.2.3 Using galaxy catalogs 

Another exciting idea for measuring Hq without the need for electromagnetic 
counterparts, and without having to restrict to a particular kind of inspiral 
event, was recently put forward by Del Pozzo He assumed three kinds of 
networks: the Advanced LIGO detectors together with Advanced Virgo (HLV), 
the same with the Japanese KAGRA added (HLVJ), and the five-detector net- 
work with IndlGO also included (HLVJI). Given an inspiral event, these net- 
works will be able to localize it on the sky with different uncertainties; similarly, 
the distance i?L extracted from the gravitational wave signal will also be subject 
to errors. Combining these uncertainties, one obtains a large 3-dimensional box 
in which the inspiral event could have occurred. A galaxy catalog will yield a 
list of potential host galaxies within this box, all having different redshifts Zi. 
Using the Hubble law Z?l = cz/Hq, the maximum-likelihood distance Dl, to- 
gether with this list of redshifts, leads to a list of possible values for the Hubble 
constant, {Hq i}. However, a second inspiral event will yield another list {Hq j}, 
which will typically have only limited overlap with the first one. As more and 
more detections are made, the true value of the Hubble constant will quickly 
emerge. Del Pozzo cast this idea into the language of Bayesian analysis, and 
found that after only 10 observations, the 95% confidence level accuracy on Hq 
is 14.5%, 7%, and 6.7% for the HLV, HLVJ, and HLVJI networks, respectively. 
After 50 observations, these numbers become 5%, 2% and 1.8%, respectively; 
see also Fig. [8j 

The advantage of this method is that it does not rely on any specific kind of 
source - in principle it can use all BNS, NSBH, and BBH detections. One issue 
will be the completeness of galaxy catalogs. However, it is possible to include in 
the analysis terms that describe the likelihood of observing a GW whose host 
galaxy was not detected by any survey because of its faintness; see [55] . 

MacLeod and Hogan explored a similar idea for measuring Hq in the context 
of LISA, with GW signals from EMRIs, and using galaxy clustering [3T. Finally, 
a supermassive binary black hole at z < 1 would be sufficiently localizable with 
LISA that one might be able to find the host galaxy cluster, which would then 
yield a redshift, allowing for a measurement of the equation-of-state parameter 
of dark energy to within a few percent |95) . 
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Figure 8: Evolution of the medians and 95% confidence intervals for the Hub- 
ble constant as information from an increasing number of coalescence events 
is combined, using a galaxy catalog to obtain information on redshifts, for the 
HLVJI network of second-generation detectors [25] , 



5.2.4 Using the neutron star equation of state to extract redshift 
from the gravitational v^rave signal 

Recently, a method was developed to extract redshift information from the GW 
signal itself, at least in the case of BNS or NSBH. In the last stages of inspiral, 
a neutron star will be deformed and acquire a quadrupole moment Qij due 
to the tidal field £ij of the companion object, and to leading order one can 



write Qi 



-XSij. Here A is the tidal deformability parameter, which depends 



both on the neutron star mass and the equation of state. These deformations 
have an influence on the orbital motion, which in turn gets imprinted onto 
the gravitational waveform. Such effects appear in the phase at 5PN and 6PN 
orders: 

^ $pp(w) + $tidal(f), (94) 

where $pp is the post-Newtonian phase under the assumption of point particles, 

and mm 
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(3179 -919xa- 2286x^ + 260 xi^) 



(95) 



where the sum is over the components of the binary, Xa = fria/M, and Aa — 
X{ma), a = 1,2. The function A(m) takes the form A(m) = (2/3) fc2 i?^(m), 
with /c2 the second Love number and R{m) a neutron star's radius as a function 
of mass. Note that A(m) enters Eq. (95) only in the combination 



A(m) 
M5 



10^ 



(96) 
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Thus, although the tidal terms only appear at very high post-Newtonian order, 
they come with a large prefactor. Their effect will be noticeable already in 
advanced detectors, and certainly in Einstein Telescope. 

Messenger and Read noted that the tidal contribution to the phase ( 95 1 only 
depends on intrinsic quantities |97j . Indeed, the expansion of the Universe as 
the signal travels from source to observer will cause the "observed" radius and 
mass to both be larger than the physical ones by a factor (l + z), which however 



will cancel from ( 96 1 and hence (|95j) . 

Einstein Telescope might see 0(10^) BNS sources. Some fraction of these 
could be used to determine the neutron star equation of state by measuring 
A(m). Once this is done, for each source in the other fraction one would be 
able to determine the observed masses mobs = (1 + ^) Wphys from the low-order 
PN contributions to the phase, and the intrinsic masses TOphys from the tidal 
contribution ( 95 1 . Hence both luminosity distances Dl and redshifts z can be 
inferred directly from the gravitational wave signals! 

This will again allow for a fit of the luminosity distance-red shif t relation 
Di^{z), thus constraining the cosmological parameters (} of Eq. (83 1, on con- 
dition that the uncertainties on redshift measurements are not too high. The 
latter depend on the equation of state, about which not much is currently known. 
Messenger and Read estimate that in the range z = 0.1 — 1, Az/z ~ 0.1 for 
the "hardest" predicted equations of state (implying the greatest deformabil- 
ity), and Az/z ^ 0.4 for "soft" equations of state. How this translates into 
constraints on the parameters n is yet to be investigated. 



6 Summary 

In a few years' time, the second-generation gravitational wave detectors are due 
to deliver their first detections. This will herald a new era in the empirical study 
of gravitation. For the first time, we will have access to the genuinely strong- 
field dynamics of gravity. As a bonus, we will be able to look at weak-field 
gravity in a novel way, by searching for effects that may only show up at very 
large distance scales, such as the ones which gravitational waves must travel 
from source to observer. 

In older literature, studies of how well one might test GR with gravita- 
tional waves mostly took the form of estimates. With the advanced detector 
era approaching, the last few years have seen the development of hands-on data 
analysis pipelines to look for deviations from GR in actual detector data. Soon 
we will be searching for alternative polarization states, as well as for possible 
anomalies in the way that dynamical gravitational fields interact with them- 
selves. For the latter, a full data analysis pipeline using coalescences of binary 
neutron stars is already in place. Binary black holes have a much richer dy- 
namics, but the added complexity also makes for a formidable data analysis 
problem, the exploration of which has only just begun. 

With third-generation ground-based detectors such as Einstein Telescope, 
and the space-based LISA, one would be able to exploit not just the inspiral 
phase, but also the ringdown. As with the phasing coefficients, the Einstein 
equations impose relationships between characteristic frequencies and damping 
times, which effectively allow for a test of the no hair theorem. This will com- 
plement the test proposed earlier by Ryan, using extreme mass ratio inspirals. 
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Binary inspirals arc "standard sirens" which can be used to probe the large 
scale structure of spacetime. Although the basic idea had already been pro- 
posed by Schutz as early as 1986, the last few years have seen the development 
of detailed methods, based on electromagnetic counterparts, exploiting the mass 
distribution of binary neutron stars, using galaxy catalogs, or employing knowl- 
edge of the neutron star equation of state. 

The coming years arc almost guaranteed to be a bonanza for gravitational 
physics. Either general relativity will be confirmed with more stringent tests 
than any that have been performed hitherto, or we will see deviations, which 
may well take the form of low-energy limits of quantum gravity effects. Any 
which way, the prospects are exciting indeed. 
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